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Abstract 

Ant colony optimization (AGO) has been applied to the field of combinatorial optimization 
widely. But the study of convergence theory of AGO is rare under general condition. In this paper, 
the authors try to find the evidence to prove that entropy is related to the convergence of AGO, 
especially to the estimation of the minimum iteration number of convergence. Entropy is a new 
view point possibly to studying the AGO convergence under general condition. 
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I. INTRODUCTION 



AGO is a recently developed, population-based approach presented by M. Dorigo and A. 



ii,y,3- 



Ant System 



Colorni etc. al., it was inspired by the ants' foraEfing behavior in 1991 

nnn 

(AS) was first introduced in three different versions [1|, 12, l3[ , they were called ant-density, ant- 
quantity, and ant-cycle. Ant Colony System (ACS) has been introduced in 0, sl to improve 
the performance of AS. Later, AS and ACS developed into a unifying framework to solve 



combinatorial optimization problems 6, ?!, and the framework is often called Ant Colony 
Optimization (ACO). ACO has been applied to solve optimization problems [ll. I2I . such 

Quadratic Assignment Problem (QAP) jl^ . 
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and Data 



as Traveling Salesman Problem (TSP)[4 

Job-shop Scheduling Problem(JSP) 13|, Vehicle Routing Problem( VRP) 
Mining(DM) \r\ . The high performance of ACO and its wide application make it as famous 
as other optimization algorithms, such as Simulated Annealing (SA) sl. Tabu Search (TS) j^, 
Genetic Algorithms (GA)[lO|, and so on. 

The study of ACO theory is necessary but rare. W. J. Gutjahr studies the convergence of 
ACO under some conditions by Graph Theory jl8|. which is called Graph-Based Ant System 
(GBAS). GBAS maps a feasible solution of optimization problem to a route in a directed 
graph. T. Stiiezle and M. Dorigo proved the existence of the ACO convergence under two 
conditions, one is to only update the pheromone of the shortest route generated at each 
iteration step, the other is that the pheromone on all routes has lower bound [19]. J. H. 
Yoo analyzes the convergence of a kind of distributed ants routing algorithm by the method 



of artificial Intel 
Markov Process 
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■igence 

22]. Ding presents a hybric 
analyzes the convergence by Markov theory 



2l[ |. Sun analyzes the convergence of a simple ant algorithm by 
gorithm of ACO and genetic algorithm, and 
23|. H ou presents a special ACO algorithm and 

3]- 



proves its convergence by fixed-point theorem 

The ways of studying ACO convergence are rare, such as Markov theory. Graph Theory, 
and so on. And only the results with some constraint conditions are obtained currently, and 
the result with no constraint condition is still unknown. The motivation of this paper is to 
explore the way to study ACO convergence under no constraint condition. 
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II. FRAMEWORK OF AGO 



In the 1990s, AGO was introduced as a novel nature-inspired method for the solution of 
hard combinatorial optimization problems (Dorigo, 1992; Dorigo et al., 1996, 1999; Dorigo 
and Stiiezle, 2004). The inspiring source of AGO is the foraging behavior of real ants. When 
searching for food, ants initially explore the area surrounding their nest in a random manner. 
As soon as an ant finds a food source, it remembers the route passed by and carries some 
food back to the nest. During the return trip, the ant deposits pheromone on the ground. 
The deposited pheromone, guides other ants to the food source. And it has been shown 
(Goss et al., 1989), indirect communication among ants via pheromone trails enables them 
to find the shortest routes between their nest and food sources. 

The framework of AGO is shown in Algorithm 1, and it is applied to solve Travel 
Salesman Problem (TSP). Where TSP can be explained as follows: for a given set of cities, 
the task of TSP is to find the cheapest route of visiting all of the cities and returning to 
starting point, provided each city is only visited once. 

Algorithm 1 

Stepl. Initialization: Initialize pheromone of all edges among cities. And put m ants 
at different cities randomly. Pre- assign an iteration number iVcmax let t = 0, where t 
denotes the t — th iteration step. 



Step2.1. All ants select its next city according to the transition probabihty defined in 
formula (1), which is the probability that the k — th ant selecting the edge from i — th city 
to j — th city. 



, where allowedk denotes the set of cities that can be accessed by the k — th ant; Tij{t) is 
the pheromone value of the edge (i, j); is a local heuristic function defined as 



Step2. while(i < Nc^J 
{ 




(1) 
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, where dij is the distance between the i — th city and the j — th city; the parameters a 
and P determine the relative influence of the trail strength and the heuristic information 
respectively. 

Step2.2. After all ants finish their travels, all pheromone values Tij{t) are updated 
according to formula ([3]). 



Tijit + 1) = (1 - p) . T,,(t) + AT,,(t) (3) 

m 

An,{t) = Y,^r'^{t) (4) 

fc=i 



ih\ TiMTT^ if the k — th ant pass edge (i, j) , , 

I otherwise 

,where L^''\t) is the length of the route passed by the k — th ant; p is the persistence ratio of 
the trail (thus, (1 — p) corresponds to the evaporation ratio); Q denotes constant quantity 
of pheromone. 

Step2.3. Increase iteration number, i.e., t <— t + 1. 

} 

Step3. End procedure and select the route which has shortest length as the output. 



III. THE STATISTICAL FEATURE OF THE SOLUTIONS OF AGO 



A. Definition of Symbols 

AGO solving the problem of TSP is the model in this paper. Suppose the m ants are ai, 

02, , am- At the t — th iteration step, ant selects route rf^ and it has length Lf\ After 

all ants finish their t — th travels, there are totally amount of pheromone f-^^ = '^Tkj{t) 

(fc,i)GrW 

depositing on router rf\ where Tkj{t) denotes pheromone depositing at the edge {k,j) by 
all ants. 

Definition 1 (Pheromone Probability): 
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it) 



n 



it) 



it) 



(6) 



In formula ([6]), E/j*^ represents the sum of pheromone of all routes, pf represents the 

ratio of pheromone that is assigned at the i — th route rf\ The more big the ratio pf^ is, the 
more possibly the edges of route rf ■* are selected by ants at the next iteration step. That 
is, pf^ is a probability which will affect the route selection of ant at the next iteration step, 
jof ■* is called as pheromone probability, and FigH] diagrammatizes it. 



it) 








FIG. 1: The Schematic of pheromone probability pf\ (a) There is a complete graph with 



four vertices, two ants af^ and a^^' act on it at the t — th iteration step, (b) Two ants a^^' and 
Og*^ select two routes rf^ and rg*^ respectively. Every edge of route rf^ contains pheromone , f^^ 
represents the sum of pheromone on all edges of route rf\ As same, /g*'' represents the sum of 
pheromone on all edges of route r2^. (c) There are two routes rf^ and totally, pf^ represents 
the ratio of pheromone assigned at route rfK The more bigger ratio pf^ is, the more possibly the 
edges at route rf^ were selected by ants at the next iteration step. That is, pf^ is a probability 
which will affect the route selection of ants at the next iteration. 



,(*) 
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Definition 2 (Route Length Set): At the t — th iteration step, m ants select m routes. 
The set of route lengths is denoted as 

L_^etW = {L«,L«,...,Lf),...,LW} 

Definition 3 (Pheromone Probability Set): The set of pheromone probabilities is 
defined as 



P_^et« = {rf\ri^...,pf\...,pW} 
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B. Statistical Features of Route Length Set 



At the t — th iteration step, the i — th ant Oj selects route rf\ where i = 1,2, ...,m. 
And route rf^ has length Lf\ It's possible that two different ants select same route and 
they have same route length. Even it's possible that, two different ants select two different 
routes, but their lengths are same. Thus, for a given value of route length x, there are 
set At{x) = {j\L^j '' = x}, where j is the subscript of route Vj^K And set At{x) is the 
set of subscripts of routes which lengths are equal to a given value x. Let denote 
the number of elements of set At{x). Number |v4((x)| represents the frequency that the 
routes with length x being selected by ants. And real number ^^^^-^ is the approximation 
of probability that represents the degree of possibility that the routes with length x being 
selected by ants. 

Let 



hAx) 



\A{x) 



m 

, where Lj„j = min{Lf^} and Lg^p = max{Lf^}. 

Then ht{x) is the function of probability in theory, and the domain of function is extended 
to the set of possitive real numbers in general. 

To observer the statistical feature of route length set L_Set^^\ its histogram is plotted, 
which is the approximation of probability function ht{x). The method of plot is proposed 
as below: 

Firstly, a two-dimensional coordinate frame is constructed, the x — axis denotes the value 
of route length x, and the y — axis denotes probability ht{x). The x — axis is divided into 
equal intervals, and the size of each interval is denoted by 6. 

Secondly, calculate the approximation of probability ht{x) for every interval / : Suppose 
interval / has a counter c which initial value is set to zero (i.e., c = 0). When a length 
Lf^ falling into this interval (i.e., Lf' G /), let the counter add one (i.e., c ^ c+1). Then 
for arbitrary x E I, there is function value — . Function value — is the approximation of 
probability ht{x). Under the condition that the size of interval / becomes very small, we 
have ht(x) = —. 

The histogram of test data prl36 is shown at FigjJl Figl2] demonstrates that route length 
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set L_Set^^^ has some statistical features, and they are summarized as below. 

(1) The value of route length x is random data and has probability ht{x). The expectation 
and deviation of set L_Set^*^ exist. 

(2) Being big in the middle and small at both sides, that is the shape of probability 
function ht{x). It's a typical distribution feature. 

(3) With the increase of iteration step (i.e., t ^ t + 1 ), the distribution of set L_Set^^^ 
will become stable. That is, the sequence of probability functions {hi{x), h2{x), ht{x), ...} 
is convergent. 



Number of solutons 

200 p 



:ions in interral 




Length of solutions 



FIG. 2: The Histogram of the Route Length Set L.S'et^: X — axis represents the value of 
route length x,Y — axis represents the probability that the routes with length x being selected by 
ants . In this figure, the probability is replaced by frequency for direct-viewing. This figure shows 
that set L_Set^^^ has statistical feature. The distribution of set L_Set^^^ is convergent. Notice: 
The test data is prl36, number of cities and ants is 136 and 544 respectively. The x — axis is 
divided into equal interval with size 2183 {i.e. 5 = 2183 ). The histogram of at t — th iteration 
is shown here, where t = 1, 10,50, 100,500, 1000. The same features are also found in other test 
data, such as prl07, dl98, pr226, d493 and so on. All test data in this paper is downloaded from 
http://www.iwr.uniheidelberg.de/iwr/comopt/soft/TSPLIB95/TSPLIB.htmll 
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C. The Expectation and Deviation of Route Length Set 

Expectation and deviation are the two most essential characteristics of distribution, these 
two characteristics of set L_Set^^^ will be calculated in this section. The expectation and 
standard deviation of set L_Set^''^^ are denoted by L and cr*^*-' respectively in this paper. L 
and o"'^*) are defined as below: 

Definition 4 (the expectation of set L_Set^^^): 

L =-y2L? (7) 

i=l 

, where m is the number of ants. 

Definition 5 (the standard deviation of set L_Set^^^): 




-(2) _{t) 

Two sequences {L ,L ,...,L ,...} and {a^^\ a^'^\ a^^\ ...} are shown at Fig|3l The 

-it) 

subfigure (a) of Fig|3] shows that expectation L descends continuously and converges to a 
constant value. The subfigure (b) shows that all standard deviations fluctuate narrowly in 
an interval and most of deviations are close to a constant value. 

D. Holding View Point of Statistics to Understand AGO Convergence 

There are three types of understanding for the convergence of AGO: 
Type 1: With the increase of iteration steps, all ants will select the optimal route which 
has shortest length. 

Type 2: With the increase of iteration steps, all ants will select a unique fixed (or stable) 
route, but it is not optimal possibly. 

Type 3: With the increase of iteration steps, more than one fixed routes are selected by 
different ants. That is, AGO converges to a stable set which consists of some fixed routes, 
not a unique route. 

AGO converging to optimal route is difficult in general, the first type is not common 
in practice. The 2nd type is also not common in practice, and it never be observed in 
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(a) Evolution of average solution length (pr136) 



upper bound 



average 



lower bound 
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Iterat ions 



900 1000 



(b) Evolution of the standard deviation of the population's solution Iengths(pr136) 

FIG. 3: The Feature of Expectation and Standard Deviation of Route Length 

-(1) -(2) 

Set: Figure (a) and (b) diagrammatize the two curves of sequence {L ,L ,---,L ,...} and 

-W 

{a^^\a^'^\ ...,a^^\ ...} respectively, where L and cr*^*-' are the expectation and standard deviation 
of route length set L_Set^^^ respectively. Figure (a) shows that expectation descends continuously 
and converges to a constant value. Figure (b) shows that all standard deviations fluctuate nar- 
rowly in a small interval comparing with average route length and most of deviations are close to a 
constant value. The similar feature are observed from other instances, such as prl07, dl98, pr226, 
d493 and so on. Notice: The test data in this figure is prl36, the number of cities is n = 136, the 
number of ants is m = 544, and the number of maximum iteration is Nq^^^ = 1000. 



the authors' experiment. Instead of the 2nd type, the 3rd type is common in practice. 
For example, Figl2] shows that, there are always different routes selected by ants at every 
iteration step, and the convergent route is not unique. Since the 3rd type is common 
and has more practical worthiness, the convergence of AGO refers to this type 
in this paper. 

In addition, the aim of AGO is to find the shortest route length, and the difference of 
the routes is not cared. Therefore, a equivalent statement of the 3rd type is that, AGO 
converges to a stable set which consists of stable route lengths . 
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According to above discusion, if AGO converges, the stable set will appear, which consists 
of stable route lengths. Then the histogram of this stable set is convergent (see Fig. [2]). 
That is, AGO converging results in probability sequence {hi{x),h2{x), ...,ht{x), ...} being 
convergent. At the same time, sequence {hi{x), h2{x), ht{x), ...} being convergent will 
result in AGO converging also, and it is proved as below: 

Let set At{x) = {j\Lj^ = x}. Then ht{x) = Thus, if {hi{x),h2{x), ...,ht{x), ...} is 

convergent, becomes fixed (or stable). Since route rj*^ represents the route selected 

by ant aj, number |Aj(x)| represents the number of ants which routes has length x. There are 
only two factors to cause |At(x)| becoming fixed. One factor is AGO being convergent. The 
other factor is that, some ants coming into set At{x) and some coming out, the quantities 
of input and output are equal. The second factor is too special so that it does not exist. 
Therefore, if {hi{x), h2{x), ht{x), ...} is convergent, AGO will be convergent. 

According to above discussion, the following conclusion is obtained: 

Conclusion 1: AGO being convergent is equivalent to the sequence of probability func- 
tions {hi{x),h2{x), ...,ht{x), ...} being convergent. 

This conclusion shows that, the histogram of route length set becoming convergent is the 
marker of AGO being convergent (see Figj2]). 

IV. USING PHEROMONE PROBABILITY TO OBSERVE THE STATISTICAL 
FEATURE OF ROUTE LENGTH SET 

A. The Pseudo-Probability and Pseudo-Histogram of Route Length Set 

The definition of pseudo-probability h[{x): 

At the t — th iteration step, every ant will select a route. The i — th ant selects route 
rf \ and rf^ has length Lf\ where i = 1,2, ...,m. Each route rf^ contains the amount of 

pheromone f^^\ which is the sum of pheromone depositing on every edge of route rf\ The 

(t) &^ 

pheromone probability is the ratio of pheromone, it is defined as p\ = ^ ' . 

Set At{x) = {j\L^j^ = x} is the subscripts set of routes which length is a given value x. 
Basing on set At{x), pseudo-probability is defined as 
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j&At{x) 

, where Linf = min{Lf^} and Lg^p = max{Lf^}. 

Pseudo-probability h[{x) is the sum of pheromone probabihties which associated route 
has length x. 

The pseudo-histogram of route length set L_Set^^^: 

Pseudo-histogram is also a histogram in which pseudo-probability h[(x) replace probabil- 
ity ht{x) to estimate the distribution of route length set L_Set^^\ It is generated by following 
method: 

Firstly, a two-dimensional coordinate frame is constructed, the x — axis denotes the value 
of route length x, and the y — axis denotes pseudo-probability h[{x). The x — axis is divided 
into equal intervals, and the size of each interval is denoted by 6. 

Secondly, calculate the approximation of pseudo-probability for every interval /: Suppose 
X represents argument and x G /. A counter d is attached to interval /, and its initial value 
is set to zero (i.e., c? = 0;). If Lf^ falls into interval / (i.e., Lf^ G /), its associated pheromone 
probability pf' is add to d (i.e., d = d + pf''). The value d is the function value of argument 
X. When the size of interval / limits to zero ideally, the value d limits to pseudo-probability 
h't{x). 

The pseudo-histogram is shown in FigJH Comparing with the histogram of probability 
function shown at Fig 121 pseudo-histogram is very similar to it. Probability ht{x) repre- 
sents the degree of possibility that the routes with length x being selected by ants, pseudo- 
probability h[{x) represents the sum of pheromone probability pf^ which associated route 

^ has length x. The similarity of these two figures excites the guess that pseudo-probability 
is the approximation of probability(i.e., ht{x) ~ h[{x)). 

-'{t) 

B. Pseudo-Expectation L and Pseudo-Deviation a 

-'(t) 

Definition 6 (L , Pseudo-Expectation of Route Length Set L_Set^^^ Calculated by 
Pheromone Probability) : 
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FIG. 4: The Pseudo-Histogram of Route Length Set L_Set^*^ Calculated by Pheromone 

Probability pf' . The X — axis denotes the value of route length x and is divided into small 



intervals. The value a.i Y — axis denotes the pseudo-probability h[{x), which is the sum of all 
pheromone probabilities pf^ under the condition Lf^ = x. The input data of this instance is 
prl36, where the number of cities n = 136, number of ants m = 544 and length of interval 
6 = 2183. Comparing with Figurel2l this figure is similar to it. And this similarity excites the 
guess pseudo-probability h[{x) is the approximation of probability ht{x), which represents the 
degree of possibility that the routes with length x being selected by ants. 

Sum of pheromone probability 
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L = ^xh'tix) = 



(t) ^ (t) 
^ Pi 



(9) 



xeV i=l 

where x denotes the value of route length and V denotes the set of these values. 
Definition 7 (cr'*^*-*, Pseudo-Deviation Calculated by Pheromone Probability): 



a 



'(t) 



m _i{t) 

\ i=l 



(10) 



_'(!) _'(2) _'(t) 

The two sequence {L , L , L , ...} and sequence {a a .., a ...} are shown 

-W -'{*) 

at FigO Comparing with Fig J31 Fig JS] is very similar to it. Two sequences {|L —L |} and 

-W -'(*) 

{|(T*^*'' — o" *^*^|} are shown in FigEl this figure demonstrates that L » L and cr*^*-' ~ a 
Expectation and deviation are two most important characteristics of set of random data. 
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And Figl5] and FigjG] are two evidences to support the conclusion 

ht{x) ^ hf{x) 

, where ht{x) and h[{x) denotes the probabihty and pseudo-probabihty respectively. 



Pseudo-Expectation 
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(a) Evolution of pseudo-expectation about pheromone probability 
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(b) Evolution of pseudo-deviation about pheromone probability 
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FIG. 5: Pseudo-Expectation L and Pseudo-Deviation a'^^^ : Comparing with FiglSl this 
figure is similar to it. This similarity provides a evidence to support conclusion 2: ht{x) ~ h'^^x). 
Notice: The input data of this shown instance is prl36, the number of ants m = 544, the number 
of cities n = 136 and the number of maximum iteration Nc^_^_^^^ = 1000. The same conchision is 
also found in other test data, such as prl07, dl98, pr226, d493 and so on. 

-it) -'(t) 

Since histogram and pseudo-histogram is very similar and L ^ L and a^*-* ~ cr'^*-*, we 
have following conclusion: 

Conclusion 2: With the increasing of iteration step, pseudo-probability is the approxi- 
mation of probability (i.e., ht{x) ~ h[{x) when t oo). 

Conclusion 1 shows probability function ht{x) being convergent is equivalent to ACO 
being convergent. Since ht{x) ^ h[{x), we have 
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Fig. -6 A: Values I L<" - L*" I, I L'"' - L^'' I, ■ • ■ I L'" - ' 



are Close to Zero 



Fig. 6-B: Values I a'-" -a"''^ 1,1 cr'' 



'I,...,Ict"'-c7""L 



are Close to Zero 



-'W _'{t) 
FIG. 6: Sequence of Pesudo-Expectation {L } and Pesudo-Deviation {a } Is Close 

- (*) _(t) _ -W -'W 

to {L } and {a } Respectively. Two sequences {\L — L |} and {[o"*^*-* — c ^*^|} are shown 

-W -'(i) 

in this figure. This figure shows that L ~ L and This evidence further supports 

conclusion 2. Notice: The experiment parameters are same to Fig. 5 



Conclusion 3: Pseudo-probability }i^{x) being convergent is equivalent to AGO being 
convergent 

Pseudo-probability }i^{x) being convergent results in AGO being convergent. And when 
AGO being convergent, every route selected by ant is fixed. This situation results in the 
amount of pheromone depositing on convergent route is fixed and its ratio (i.e., pheromone 
probability pf^) is fixed too. Therefore, function fl^{x) being convergent results in pheromone 
probability pf^ being convergent, where i = 1,2,..., m. On the other hand, pheromone 
probability pf^ being convergent results in the function h'^{x) being convergent and AGO 
being convergent. Then, we have 



14 



Conclusion 4: Pseudo-probability h[{x) being convergent is equivalent to pheromone 
probability set PSet^*^ being convergent, where the convergence of P^Set^*'^ refers to that 
every pheromone probability in this set is convergent. 

Conclusion 5: Pheromone probability set PSet^^^ being convergent is equivalent to 
AGO being convergent. 



V. ENTROPY CONVERGENCE 



A. Entropy of Pheromone and Its Convergence 



In 1948 Shannon introduced the entropy 25|] into information theory for the first time. In 



information theory, entropy is a measure of the uncertainty associated with random system. 
The lower entropy is, the lower the uncertainty of system is. Entropy is defined as 

n 

H = -^Pi ■ log.pi (11) 
1=1 

, where Pi denotes the probability. 

At t — th iteration of AGO, ant select route rf\ where i = l,2,...,m. Route rf^ 
associates with pheromone probability pf\ which is the ratio of pheromone assigned at 
route rf\ All pheromone probability pf^ comprise set P_Set^^^ = {Pi\p2 \ ■■■,pf\ ■■■,Pm}- 

According to EqlTTl entropy of pheromone is defined as 

n 

H{P_Set^'^) = -J2p?-log,p? (12) 



1=1 



It is simplified as 



Ht = -Y.Pt^ ■ logjf (13) 

i=l 

Pheromone probability pf^ represents the ratio of pheromone assigned at route rf\ If 
every route is assigned equal amount of pheromone, all ants don't know which route is 
best and select route randomly. At this time, all pheromone probabilities are equal (i.e., 
pf^ = ^), entropy of pheromone is maximum, and the uncertainty degree that ants selecting 
route is maximum. And this situation is often happened at the early iteration steps of 
AGO. If pheromone is assigned at few routes, most of ants will select these routes with high 
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probability. At this time, there is low uncertainty for ants selecting route, entropy is small. 
This situation is often happened at the iteration steps at which AGO is close to convergence. 

With the increase of iteration step, every route selected by ants will become fixed, the 
pheromone depositing on it becomes fixed (stable) and its pheromone probability becoming 
fixed too. This situation results in the sequence {Hi, H2, Hf, ...} converging. The test 
result at Fig J7| shows that the entropy sequence is convergent. 



Entropy 
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4.6724 - ; I'l 1 fj 



Entropy EvolutioD of entropy of ACO algorithm solving pr107 
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4.9122 - 



Evolution of entropy of ACO algorithm solving pr136 

FIG. 7: Entropy of Pheromone Is Convergent. Two curves show that two entropy sequences 
are convergent approximately. And the amplitude of swing is very narrow, it is less than 0.0002/4.6 
or 0.0004/4.9 . Notice: The shown test data are prl07 and prl36, the number of ant is equal to 
the number of cites (i.e. m = 107 and m = 136), the number of maximum iteration is Nq^^^ = 500 
and the feature of entropy convergence is observed from other test data also, such as dl98, pr226, 
d493 and so on. The entropy is calculated by H{t) = — X^pf^ • Inpf^ in this figure. 

i=l 



B. Entropy Convergence Is A Marker of ACO Convergence 

Entropy is the most essential characteristics of a random system. Thus, the convergence 
of entropy sequence {Hi, H2, Hf, ...} is the marker of the convergence of set P_Set^*\ Set 
being convergent is equivalent to ACO being convergent according to conclusion 
5. Therefore, the convergence of entropy sequence {Hi, H2, Ht, ...} is the marker of the 
convergence of ACO. When ACO is convergent, set P_Set^^^ is convergent, entropy sequence 
is convergent too. If ACO is not converg ent, set P_Set^^^ is not convergent, entropy sequence 
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is not convergent too. On the other hand, when entropy sequence is converg ent, set P_Set^^^ 
is convergent very possibly because entropy is its essential characteristic, AGO is convergent 
too. If entropy sequence is not convergent, set PSet^^^ is not convergent very possibly, AGO 
is not convergent too. 

Therefore, the convergence of entropy sequence is a marker of minimum iteration steps 
at which ACO is convergent. 

In addition, the convergence of entropy sequence {Hi, H2, Ht, ...} has usual criterion 



26|. And criterion ^ < £ is a very simple criterion to estimate the 



\H,_H^ < e 

minimum iteration number at which AGO is convergent possibly. 



VI. APPLICATION OF ENTROPY CONVERGENCE 

A. Apply Entropy Convergence as Termination Criterion of ACO 

The improved AGO algorithm with criterion < £ is presented as below, and it 

is named ACO-Entropy in this paper. 
Algorithm ACO-Entropy 

Stepl. Initialize pheromone trails for all edges and put m ants at different cities. Let 
t = , Avij (0) = and Hq = log2 m. 
Step2. do 

{ 

Step2.1 t^t + 1. 

Step2.2 The ants choose next cities according to transition probability. 
Step2.3 After all ants finish their travels, pheromone are updated. 
Step2.4 The pheromone probability pf^ and the entropy Ht are calculated by 
formula ([6]) and (fT3|) respectively. 
}while(^^^^ > e) 

Step3. End procedure and output result. 



B. The Experiment and Comparison 



All data tested in this paper are downloaded from ht t p : / / www . i wr . uniheidelberg . de /i wr / 
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comopt/soft/TSPLIB95/TSPLIB.html. All algorithms in this paper run on personal 
computer, CPU (2): 1.60GHZ, Memory: 480M, Software: Matlab 7.1. All parameters are 
set as below: 



a = 1, /3 = 8, p = 0.4, Q = 100, 7:^(0) = 1, m = n, e = 0.001, 



1000. 



To test the performance of ACO-Entropy, two algorithms of AGO and the ACO-Entropy 
are tested in this paper, where AGO refers to Ant-Gycle shown at section II, which is often 
used standard algorithm. 

Table. 1 and FigE] show that, AGO-Entropy is faster than AGO by factors of 2-6 under 
the same condition and nearly same quality of solution is obtained. 



I ACQ 

I ACO-Entropy 



Ratio=time (ACO)/time (algorithm) 



it 



prl07 prl36 dl98 pr226 



1 1 

2.6459 3.8124 



1 

6. 3309 



d493 



1 1_ 

1.7079 3.2311 



FIG. 8: Comparison of the Running Speed of AGO and ACO-Entropy. ACO-Entropy is 

faster than AGO by factors of 2-6 under the same condition and the nearly same quality of solution 
is obtained (see Table 1), wliere AGO refers to Ant-Cycle shown at section II. 
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Input 


Number 


ACO-Entropy 


Data 


of Test 


AvoTMPo SolntioTi 




Tt prat inn Nninlipr 


prl07 


10 


46294 


163 0804 


189 


prl36 


10 


108467 


173.3620 


131 


dl98 


10 


17135 


447.4071 


100 


pr226 


10 


84718 


2466.1 


293 


d493 


2 


39851 


16405 


155 




ACO 


prior 


10 


45973 


431.496 


500 


prl36 


10 


102608 


660.918 


500 


dl98 


10 


16891 


2832.5 


500 


pr226 


10 


84514 


4211.8 


500 


d493 


2 


38926 


53007 


200 


Tabid. Performance Comparison of AGO and ACO-Entropy: 

This table shows that ACO-Entropy is faster than ACO by factors of 2-6. 

The two solution qualities of ACO-Entropy and ACO are nearly same. 



VII. CONCLUSION 

The convergence of ACO is the base of ACO, its study is not much currently. The 
convergence under some special conditions has be studied, and the view point of study are 
Graph theory, Markov process, and so on. It is interesting to find a new view point to study 

ACO convergence under general condition. The aim of this paper is to explore new view 
point of studying ACO convergence under general condition and to find the new marker of 
ACO convergence. 

Since ACO is kind of probabilistic algorithm, the feature of its convergence possibly hide 
in some statistical properties. Thus, the analysis of statistical property is the start point of 
study of this paper. Along this start point, five equivalent statements of ACO convergence 
are found in this paper (see Confusion 1-5). And these equivalent statements result in the 
following conclusion: 

ACO may not converges to the optimal solution in practice, but its entropy is convergent 
under general condition. 
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